Atomic Broadcast in Asynchronous Crash-Recovery Distributed Systems
نویسندگان
چکیده
Atomic Broadcast is a fundamental problem of distributed systems: it states that messages must be delivered in the same order to their destination processes. This paper describes a solution to this problem in asynchronous distributed systems in which processes can crash and recover. A Consensus-based solution to Atomic Broadcast problem has been designed by Chandra and Toueg for asynchronous distributed systems where crashed processes do not recover. Although our solution is based on di erent algorithmic principles, it follows the same approach: it transforms any Consensus protocol suited to the crashrecovery model into an Atomic Broadcast protocol suited to the same model. We show that Atomic Broadcast can be implemented without requiring any additional log operations in excess of those required by the Consensus. The paper also discusses how additional log operations can improve the protocol in terms of faster recovery and better throughput. It is interesting to note that our work bene ts from recent results in the study of the Consensus problem in the crash-recovery model.
منابع مشابه
Atomic Broadcast in Asynchronous Crash-Recovery Distributed Systems and Its Use in Quorum-Based Replication
Atomic Broadcast is a fundamental problem of distributed systems: It states that messages must be delivered in the same order to their destination processes. This paper describes a solution to this problem in asynchronous distributed systems in which processes can crash and recover. A Consensus-based solution to Atomic Broadcast problem has been designed by Chandra and Toueg for asynchronous di...
متن کاملQuorum-Based Replication in Asynchronous Crash-Recovery Distributed Systems (Research Note)
This paper describes a solution to the replica management problem in asynchronous distributed systems in which processes can crash and recover. Our solution is based on a Atomic Broadcast primitive which, in turn, is based on an underlying Consensus algorithm. The proposed technique makes a bridge between established results on Weighted Voting and recent results on the Consensus problem.
متن کاملReal-Time Fault-Tolerant Atomic Broadcast
We present algorithms for Real-Time Fault-Tolerance Uniform Atomic Broadcast developed in the framework of the French project ATR (accord temps réel). We first design a distributed execution model for asynchronous systems with crash failure we called Synchronized Phase System (SPS), then we give an algorithm for Atomic Broadcast in SPS. In a SPS, the processes try to run in synchronized rounds ...
متن کاملGroup, Majority, and Strict Agreement in Timed Asynchronous Distributed Systems
Atomic broadcast is a group communication service that enables a team of distributed processes to keep replicated datàconsistent', despite concurrency, communication uncertainty, failures and recoveries. We investigate possible meanings for replicated datàconsistency' in timed asynchronous systems, subject to crash/performance process failures and omis-sion/performance communication failures wh...
متن کاملConsensus: The Big Misunderstanding
The paper aims at clarifying some misunderstandings about the consensus problem. These misunderstandings prevent consensus from being considered as it should be, i.e., a fundamental paradigm in the context of fault-tolerant distributed systems, not only from a theoretical point of view, but also from a practical point of view. Six frequent misunderstandings are discussed. Misunderstanding 1: Co...
متن کامل